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PRODUCTION OF COMPLEX CARBOHYDRATES 

5 CLAIM OF PRIORITY 

This application claims priority under 35 U.S.C. 1 19(e) from U.S. . 
Provisional Application Serial No. 60/134,756, filed May 18, 1999, which 
^plication is incorporated herein by reference. 

GOVERNMENTAL RIGHTS 
10 The United States Government retains certain rights in this inventioa 

Financial support was provided by the National Institute of Allergy and 
Infectious Diseases under Grant Number A 1240 16 and from the NIH National 
Center for Research Resources under Grants Number RR01614 and RR041 12, 

FIELD OF THE INVENTION 
15 This invention relates to a method for the production of complex 

carbohydrates"on-m-LPS-backbone-stnicture-in-Gram=negative-bacteria. 

BACKGROUND OF THE INVENTION 
Complex carbohydrates occur in nature and are involved in a wide array 
of biological fiinctions, including viral, bacterial and fungal pathogenesis, 
20 cell-to-cell and intracellular recognition, binding of hormones and pathogens to 
cell-surface receptors and in antigen-antibody recognitioa The term "complex 
carbohydrates" embraces a wide array of chemical compounds having the 
general formula (CH20)„ where the monomer unit is selected from any of 
thousands of naturally occurring or synthetic monomers, including, but not 
25 limited to, glucose, galactose, mannose, fucose and sialic acid. Saccharides may 
have additional constituents such as amino, sulfate or phosphate groups, in 
addition to the carbon-hydrogen-oxygen core. The polymer consisting of two to 
ten saccharide units is termed an oligosaccharide (OS) and that consisting of 
more than ten saccharide units is termed a polysaccharide (PS). These 
30 monosaccharide building blocks can be linked in at least 10 different ways, 
leading to an astronomical number of different combinations and permutations. 
It is found that strains within species and even tissue within an organism differ in 
complex carbohydrate structure. This high degree of variability, the highly 
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specific composition of naturally occurring complex carbohydrates and the wide 
range of biological roles make these compounds especially significant. 

Gram-negative bacteria contain complex carbohydrates, which are linked 
to lipids to form lipooligosaccharides (LOS) or lipopolysaccharides (LPS.) The 
5 immimogenicity of the LOS and LPS resides in the carbohydrate moiety, while 
pathogenicity resides in the lipid moiety. For this reason, OS and PS are useful 
as vaccines against Gram-negative pathogens and for identification of gram- 
negative bacteria. 

United States Patent Application 5,736,533 discloses oligosaccharides 

10 useful as therapeutic agents against pathogens that are the causative agents of 
respiratory infections. It is believed that pathogenic bacteria are able to colonize 
tissue by binding to carbohydrates on the surface of the tissue and that providing 
an excess of specific soluble oUgosaccharides can result in competitive inhibition 
of bacterial colonization. 

15 OS and PS firom LPS and LOS can be produced by growing the specific 

bacterial pathogen in culture, with subsequent cleava g e of the lipid moiety and 
purificatioa However, most pathogenic bacteria are fastidious in their growth 
requirements and slow growing, making this mode of production impractical. 
For example, Haemophilus influenzae is known to require a carbon dioxide 

20 atmosphere and brain/heart extract for growth. Helicobacter pylori grows very 
poorly in broth cultures required for OS and PS productioa In addition, many of 
these bacterial pathogens (for example. Neisseria meningitidis) can be dangerous 
to grow in large volimies because of the risk of aerosol and possible infection 
spread The ability to produce the OS and PS structures of fastidious bacterial 

25 pathogens in bacterial strains such as Escherichia coli and Salmonella minnesota 
which grow rapidly to high density offers a rapid way to produce these OS and 
PS fi-om fastidious bacterial pathogens. 

Eucaryotic proteins and peptides fi-equently have carbohydrate moieties 
on their surfaces, which act as specific binding sites for hormones, which are 

30 also glycosylated, that is, have complex carbohydrates linked to the peptide 
stmcture. Moreover, in addition to the recognition role, carbohydrates are 
necessary to the proper three-dimensional folding of polypeptides into functional 
glycoproteins. Bacteria do not glycosylate peptides and proteins efficiently or in 
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a manner equivalent to that of eucaryotes. For that reason, although bacteria are 
widely used as production cells for growing eucaryotic peptides and proteins, 
such useful human glycopeptides such as erythropoetin are grown in mammalian 
cells. United States Patent Number 4,703,008 discloses a method for the 
5 production of erythropoietin, in which cells such as Chinese hamster ovary cells 
are transfected with the DNA coding for the hormone and grown under a carbon 
dioxide atmosphere in complex medium. The resulting hormone is sufficiently 
similar to the naturally occurring hormone to be effective as a therapeutic for 
human use. 

1 0 An additional utility for isolated, cell-specific carbohydrates is for 

competitive inhibition of disease agents in which infection is reliant on surface- 
recognition glycosylated proteins. For example, the human immunodeficiency 
virus is known to bind to the surface receptor on T-4 lymphocytes. If an excess 
of firee T-4 receptor carbohydrate is present in the bodily fluids of the patient, the 

15 virus will bind to the fi-ee carbohydrate and is effectively prevented fi-om 

infecting the T-4 lymphocyte. 

Competitive inhibition of binding of antibodies to cell surfaces by 
administration of cell-recognition molecules may have therapeutic potential in 
the treatment of autoimmune diseases such as lupus erythematosus, multiple 

20 sclerosis and rheumatoid arthritis. Such moledules may bind to the cell receptor, 
blocking the binding of the automimmunie antibodies which cause the 
degeneration seen in such disease states. 

United States Patent Number 4 J45, 051 discloses a method for 
expressing DNA in an insect cell, a method that has practical application for the 

25 production of glycosylated peptides and proteins. However, the glycosylation 
resulting is that native to the insect, consisting of higher levels of maimose than 
are typical of mammalian cells. 

Practical production of peptides and polypeptides in bacterial production 
cells is well established. Chemical and enzymatic means for glycosylating 

30 peptides and proteins are well known in the art For example. United States 
Patent 5,370,872 discloses a method for coupling PS through a carboxyl or 
hydroxy 1 group to a protein. Classic organic syntheses of complex 
carbohydrates have long been known, but with limited practical application In 

3 
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addition to the difficulties inherent in the complexity of the glycopolymer 
molecule, many glycosidic bonds are labile and must be protected and 
deprotected during chemical synthesis, adding to the difficulty of synthesis and 
reducing the yield of product. 
5 Because of the drawbacks of organic synthesis, enzymatic synthesis has 

been devised. It is known that glycosylation proceeds by the step-wise addition 
of monomers through the action of such enzymes as glycotransferases. The 
reaction products can be further modified by lyases, acetylases, sulftases, 
phosphorylases, kinases, epimerases, methylases, transferases and the like. 

10 United States Patent Number 5,308,460 discloses such a step-wise synthesis on 
an immobilized matrix. 

A need remains for a more efficient and practical method for the 
production of complex carbohydrates, and glycoproteins and glycopeptides 
containing complex carbohydrates specific to a species or tissue. 

15 SUMMARY OF THE INVENTION 

The present invention is directed to the production of complex 

carbohydrates in a production cell. It is here disclosed that certain bacteria, such 
as Escherichia coli Strain K-12, have a core liposaccharide with a terminal 
heptose. A suitable production cell also contains an enzyme which catalyzes the 

20 transfer to the terminal heptose of an acceptor molecule, such as N- 

acetylglucosamine, to form a "scaffold" upon which glycotransferases add other 
saccharide monomers to form complex carbohydrates. If an otherwise suitable 
production ceU lacks such an enzyme, the DNA encoding the gene rfe (UDP- 
GlcNAc;Undecaprenol GlcNAc-1 phosphate transferase) of Haemophilus 

25 influenzae may be inserted into the production cell. Preferably, production of rfe 
is enhanced by the presence of the gene products of the gene IsgG. By inserting 
genes encoding glycotransferases into the production cell, the complex 
carbohydrates specific to bacteria such as Haemophilus influenzae^ Neisseria 
spp. Salmonella spp and Escherichia coli are produced. Mammalian complex 

30 carbohydrate such as polysialyl can also be produced. 

Accordingly, the invention provides a process for the production of a 
complex carbohydrate which comprises the steps of: (a) inoculating transformed 
production cells into a culture medium capable of supporting the growth of said 
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production cells wherein said production cells are prepared by transforming 
bacteria comprising (i) a core lipid structure containing a terminal heptose 
molecule and (ii) an enzyme capable of adding an acceptor molecule to said 
heptose molecule by inserting an isolated DNA sequence encoding 
5 glycotransferase synthesizes a complex carbohydrate into said bacteria to yield 
transformed production cells; (b) allowing growth of said transformed 
production cells; and (c) recovering said complex carbohydrate from the culture 
mediiun. 

The invention also provides a process for the production of an 

10 oligosaccharide which coniprises the steps of: (a) transforming gram-negative 
bacteria comprising (i) a core lipid strucmre containing a terminal heptose and 
(ii) an enzyme that adds a galactose molecule to said heptose wherein said 
transformed gram-negative bacteria are prepared by constructing a vector 
comprising an isolated DNA sequence coding for a glycotransferase that 

1 5 synthesizes an oligosaccharide; (b) inoculating said transformed gram-negative 

b a cteria into a culture medium capable of supporting the growth of said 

transformed bacteria; (c) allowing growth of said inoculated gram-negative 
bacteria; and (d) recovering said oligosaccharide from the culture medium. 

Using methods disclosed in this application, a production cell suitable for 

20 the practical production of other complex carbohydrates can be identified. Such 
a suitable production cell will have an acceptor molecule specific to the 
carbohydrate to be synthesized, or a site that can be modified to add such a 
specific acceptor molecule. The production cell will contain the initiating IsgG 
or hgF to form the appropriate acceptor. DNA coding for the glycotransferases 

25 of other species, strains, tissues, hormones, receptors or other cell-surface 

carbohydrates can then be inserted into such a production cell, with the resultant 
production of oligosaccharides or polysaccharide specific to that species, strain, 
tissue, hormone, receptor or other cell-surface carbohydrate. The nucleotide 
sequences for the genes rfc and IsG are on file in the H. influenzae Rd database 

30 available from TIGR (Bethesda, MD). Sequences for glycotransferases are 
available from die references herein disclosed. 

Also provided are methods for separating and purifying the product. 
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The invention also provides a process for the production of a complex 
carbohydrate, comprising culturing production cells comprising a chimeric DNA 
sequence encoding a glycotransferase, so as to yield production cells comprising 
an altered level of complex carbohydrate, wherein the production cells are 
5 bacteria comprising a core lipid structure containing a terminal heptose molecule 
and encoding an enzyme capable of adding an acceptor molecule to the heptose 
molecule. The invention also provides a process further comprising recovering 
the complex carbohydrate. 

BRIRF D FS CR TPTT Q N Q F T HE DRAW rNGS 
10 Figure 1 : The Isg region of Haemophilus influenzae DNA, 

(A) Diagram of the eight orfs. (B) Locations of m-Tn3(Cm) insertion sites (6). 
(C) Restriction maps of the EMBLOS-1 subclones that modified the coli JM 
109 LPS, 

Figure 2 : SDS-PAGE of the LPS from E, coli strain JM 1 09 (designated 

15 pGEM) and the three chimeric strains, pGEMLOS-7, pGEMLOS-5, and 

pGEMLOS-4. The LPS ran g e in molecular weight fix)m -3.3 to 5,5 kPa. 

Figure 3: Proposed structures of the chimeric oligosaccharides. Only 
the complete E. coli K-12 core structure containing a fourth heptose on the 
terminus of the oligosaccharide branch undergoes modification. Additional 

20 saccharides (designated R) are added to the 7-position of this heptose to form the 
chimeric oligosaccharides. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention provides a method by which the terminal heptose 
of the core structure of any gram bacterial species which contains the gene rfe 

25 (UDP-GlcNAc:Undecaprenol GlcNAc-1 phosphate transferase) (Alexander et al. 
(1994) J. BacterioL, 176-7079-7084) can be modified so as to act as an acceptor 
for oligosaccharide synthesis. The ffe gene encodes for a protein which 
catalyzes the transfer of N-acetyl glucosamine (GlcNAc, an "acceptor 
molecule") onto the carrier lipid undecaprenol phosphate. The regulation of this 

30 gene can be controlled with a regulator)' gene, IsgGy identified in Haemophilus 
influenzae. The increase in rfe expression caused by IsgG mediates the 
deposition of a GlcNAc residue on the terminal heptose of LPS and LOS firom a 
variety of gram-negative bacterial species including £. coli. Salmonella 
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minnesota and H, influenzae. This GlcNAC has been found to function as an 
acceptor molecule forming a scaffold for the sequential addition of saccharide 
monomers, under the direction of glycotransferases. For example, the 
galactosyltransferase gene, IsgF, results in the addition of a galactose to the 
5 GlcNAc. The gene sequence coding for the Isg glycotransferases of//. 

influenzae has been inserted into an Escherichia coli K-12 strain production cell, 
with the resultant production of H. influenzae-spoci^c LOS epitopes in E. coli. 

Any production cell containing an initiating enzyme similar to rfe can 
add an appropriate acceptor to form the scaffold. The production will preferably 
10 also contain the regulatory gene IsgG, DNA coding for the glycotransferases of 
other species, strains, tissues, hormones, receptors or other cell-surface 
carbohydrates can then be inserted into such a production cell, with the resultant 
production of oligosaccharides or polysaccharidea specific to that species, strain, 
tissue, hormone, receptor or other cell-surface carbohydrate. 

15 

Definitions: 

Complex carbohydrates: any polymer of formula (CH^OX, where n equals at 
least three monomer units, including polymers with additional substituents 
including but not limited to SO4, PO4, CO4, CH3, NH4, and such polymers linked 
20 to lipids, peptides and proteins. 

Production cell: A production cell useful in this invention is defined as any 
bacterium which contains an LPS or LOS-saccharide inner core 
terminating in a molecule and containing an enzyme enable of adding an 

25 acceptor molecule to the terminal molecule to serve as a scaffold for 

elongation and which can be transformed with exogenous DNA coding 
for glycosyltransferases. Cells that are otherwise suitable but lack the 
proper acceptor molecule may be used as production cells if they are co- 
transformed with genes such as rfe and IsgG^ to appropriately modify the 

30 LPS or LOS to function as an acceptor molecule for the formation of a 

scaffold. 
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Hib production cell: the preferred cell for production of//, influenzae type B- 
specific OS is preferably a gram-negati\ e bacteriuim, most preferably £. coli K- 
12 strain JM 109. 

Synthetic gene(s)\ the DNA coding for the enzyrhe or enzymes that synthesize 
the desired complex carbohydrate. These genes include those coding for 
glycotransferases, lyases, acetylases, sulfatases, phosphorylases, kinases, . 
epimerases, methylases and the like. 

Example 1'. Selection of a Hib production cell. 

Capsular strains of Haemophilus influenzae type b (Hib) are responsible 
for various invasive and bacteraemic infections in humans, including meningitis 
and pneumonia The surface lipoohgosaccharides (LOS) of Hib are known to be 
important factors in microbial virulence and pathogenesis. Structural studies of 
Hib LOS from wild-type and mutant strains have shown that the LOS contains a 
conserved heptose trisaccharide core which can be extended with additional 
sugars on each heptose. Recently, a re\nsed structure of the £. coli K- 12 core 
region was reported which also contains a heptose trisaccharide iimer core and a 
fourth heptose present on the terminus of the main oligosaccharide branch: 
20 Galal Hepa 1 

1 .1 
6 7 

Hepal - 6Glcal - 2Glcal - -SGlcal - 3Hepal - 3Hepal - 5Kdo 

25 Previous work showed that the core region of E. coli transformed with 

synthetic enzyme genes could be elongated by the addition of saccharide 
monomers under the direction ofH. influenzae genes to produce a modified E. 
coli LPS that was elongated by approximately five monomer units. It was 
thought that the monomers were added at each of the heptoses. (Kwaik et al., 

30 Molecular Microbiology, 5:2475-2480 (1990).) Therefore, efforts were made to 
transform an E. coli K-12 strain termed JM109 with K influenzae synthesis 
genes in an attempt to determine whether an LOS substantially identical to that 
of K influenzae could be produced. 

8 
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Escherichia coli strains were routinely cultured at 37*^ C using LB agar or 
broth with appropriate antibiotics. Vectors used in these studies were previously 
described. (Kwaik et al. (1990)). LPS from E. coli JM 109 was prepared by the 
extraction procedure of Darveau and Hancock (Darveau et al. J. Bacteriol. 
5 155(2), 831-838 (1983).) The LPS was separated by SDS-PAGE in resolving 
gels containing 15 % acrylamide, and visualized by silver staining. 

To determine the structure of this chimeric LPS, a few milligrams of LPS 
from each sample were treated with anhydrous hydrazine at 37^ C for 20 
minutes, and then precipitated with cold acetone. 
10 In order to establish the chemical structure of the E. coli core and 

determine the E, coli acceptor residue, the LPS from E. coli strain JM 109 was 
partially characterized using composition analysis, linkage analysis, and mass 
spectrometry as described in Example 6 below. 

15 2. Isolation of the LQS synthetic genes from Hib. 

Hib strain A-2 was ori g inally isolated from the spinal fluid of a child 

with meningitis. Hib A-2 was grown on chocolate agar supplemented with 
amino acids and vitamins or brain heart infiision agar supplemented with 4% 
Fildes reagent (Difco Laboratories, Detroit) at 35° C in 5% COj atmosphere. 

20 A gene cluster from Hib strain A-2 containing LOS synthesis genes {Isg) 

was previously cloned (Kwaik etal, (1990). The fag loci are contained within a 
7.4 kb DNA fragment, consisting of seven complete open reading frames (orfs). 
This region is one of several distinct loci also found in the genome sequence of 
Hib strain Rd which has been associated with lipopolysaccharide (LPS) 

25 biosynthesis. 

DNA from the Isg region of Hib strain A-2 was used to construct a 
genomic library in the lambda bacteriophage EMBL3 (Kwaik et al, 1990)). 
Twenty six phage clones were prepared which expressed Hib LOS 
oligosaccharide epitopes in E. coli strain LE392. The phage transformant 

30 designated EMBLOS- 1 produced a chimeric LPS with a 1 .4 kDa oligosaccharide 
added to the 4.1 kDa LPS of £. coli LE392. Monoclonal antibody (MAb) 6E4, 
which recognizes two components in the Hib A2 LOS mixture, also recognized 
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the novel 5.5 kDa component in the chimeric LPS, indicating some 
immunochemical similarity to Hib LOS. 

Exam ple 3. Transform a t io n of the Hib production cell. 
5 Restriction jfragments of EMBLOS-I were used to make a series of 

plasmids which modified E. coli strain JM 109 to give clones which produced a 
proposed chimeric series of higher mass LPS species. The transformants termed 
pGEMLOS-4, pGEMLOS-5, and pGEMLOS-7 generated modified or chimeric 
LPS of 5.5, 5.1, and 4.5 kDa, respectively. All three apparently modified the 4.1 

10 kDa LPS species from E, coli, although only the LPS from pGEMLOS-4 
expressed the 6E4 epitope. The LPS from strain pGEMLOS-5 was found to 
react positively with MAb 3F 1 1, suggesting the presence of terminal 
N-acetyllactosamine. The epitope recognized by MAb 6E4 is also present in the 
LOS of H. influenzae nontypable strain 2019, as well as the LPS from 

1 5 Salmonella minnesota Re mutant Binding of this monoclonal antibody to H. 
influenzae LOS can be inhibited by Kdo and the Kdo trisaccharide from the Re 
mutant of S. minnesota. Because the 6E4 epitope has been associated with the 
core of Haemophilus LOS, it was originally proposed that the chimeric structures 
expressed in E, coli might arise from the addition of a Haemophilus core 

20 structure to an acceptor residue of the £. coli 4.1 kDa LPS species. 

The Hib production cell was transformed with the plasmid pGEM3Zf+ 
into which different DNA restriction fragments from H influenzae strain A-2 bg 
locus had been ligated (see Table 1 and Figure 1). The plasmid pGEMLOS-4 
contained a 7.4 kb bamhl-pstl fragment of DNA which contained all seven open 

25 reading frames (A-G) comprising the Isg locus. The plasmid pGEMLOS-5 
contained a 5.5 kb Hind lll-pstl fragment of DNA comprising 5 open reading 
frames (C-G) of the Isg. The plasmid pGEMLOS-7 contained a 2.8 kb sphl-pstl 
fiagmoit of DNA comprising 2 open reading frames (F-G) of the Isg locus. The 
plasmid pGEM3zf+ without an insert was also transformed into strain JM 109. 

30 This strain and the LPS isolated from it were termed PGEM. 
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Example 4 . I solation and Pu rification of Q ligo s a ccharides. 

. The LPS from PGEM (3 1 mg), pGEMLOS-4 (25 mg), pGEMLOS-5 (15 
mg), and pGEMLOS-7 (4.4 mg) was hydrolyzed in 1% acetic acid (2 mg 
LPS/ml) for 2 hours at 100° C. The hydrolysates were centrifiiged at 5000g for 
5 20 min at 4° C and the supematants removed. The pellets were washed with 2 
ml of HjO and centrifiiged again (5000g, 20 min, 4'' C), The supematants and 
washings were pooled and lyophilized to give the oligosaccharide fractions. As 
a standard, 10 mg of LPS fi-om Salmonella typhimurium TT 1 19 Ra mutant 
(Sigma, St. Louis) was treated in the same fashion. 

10 To prepare desalted oligosaccharide pools for ESI-MS analysis, small 

aliquots of the crude oligosaccharide firactions (<2 mg) were chromatographed 
on two Bio-Select SEC 125-5 HPLC columns (Bio-Rad, Richmond, CA) 
connected in series, using 0.05 M pyridinium acetate (pH 5.2) at a flow rate of 
Iml/mia A refractive index detector was used to monitor colxunn efiQuent and 

1 5 chromatograms were recorded and stored with an integrator. 

For large scale separations, the oligosaccharide fractions from PGEM 
(10.2 mg), pGEMLOS-4 (9.3 mg), and pGEMLOS-5 (7.0 mg) were dissolved in 
0.3 ml of 0.05 M pyridinium acetate buffer (pH 5.2) and centrifuge-filtered 
through a 0.45 gm Nylon-66 membrane. The PGEM and pGEMLOS-4 samples 

20 were ^plied to a single Bio-Gel P-4 column (1.6 x 84 cm, <400 mesh; 

Bio-Rad), and the pGEMLOS-5 sample was applied to two Bio-Gel P-4 colunnuis 
connected in series (1.6 x 79 cm and 1.6 x 76.5 cm). The columns were 
equipped with water jackets maintained at 30° C. Upward elution at a flow rate 
of 10 ml/h was achieved with a P-1 peristaltic pump (Pharmacia, Piscataway). 

25 Effluent was monitored with refractive index and fractions were collected at 10 
minute intervals and evaporated to dryness in a concentrator. 

Example 5. Dephosphorylation of Oligosaccharides. 

Oligosaccharide fractions were placed in 1.5 ml polypropylene tubes and 
30 treated with cold 48% aqueous hydrogen fluoride to make 5-10 Mg/ml solutions. 
Samples were kept for 18 hours at 4° C and then aqueous HF was evaporated. 
The dephosphorylated samples were then rechromatographed on two Bio-Select 
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SEC 125-5 HPLC columns connected in series using 0.05 M pyridinium acetate 
(pH 5.2). 

Ryampie 6. Characterization of product. 
5 The reactivity with monoclonal antibodies raised to the naturally 

occurring Hib LOS, as shown in Example 2, indicated that the product had the 
same immunochemcial function as the naturally occurring Hib LOS. The 
samples were further analyzed by different techniques in order to determine 
structural identity to the desired complex carbohydrate. 

10 

a. Monosaccharide Composition Analysis. 

Dephosphorylated oligosaccharide fractions were dissolved in 400 filof 

2 M trifluoToacetic acid and heated for 4 hours at 100^ C. The hydrolysates were 

evaporated to dryness in a Speed-Vac concentrator, redissolved in 20 a^I HjO, 
15 and dried again Hydrolysates were analyzed by high-performance anion 
exchangeckromatogr^hy^with-pulsed-amperometrie-detection-using-arDiottex — 

BioLC system (Dionex, Suimyvale, CA) with a CaiboPac PAl column. 



b. Methylation Analysis. 
20 Linkage analysis was performed on dephosphorylated oligosaccharide 

fractions using the microscale method modified for use with powdered NaOH. 
Partially methylated alditol acetates were analyzed by GC/MS in the El and CI 
modes on a mass spectrometer. 

25 c T jquiH Semndary Ton Mass Spectrometry (LSIMS). 

LSIMS was performed using a mass spectrometer with a cesium ion 
source. Oligosaccharide samples (in 1 ^1 H20)were added to 1 jj\ of 
glycerol/thioglycerol (1; 1) on a stainless steel probe tip. A Cs+ ion primary 
beam energy of 10 keV was used and the secondary sample ions were 

30 accelerated to 8 keV. Scans were taken in the negative-ion mode at 300 s/decade 
and recorded with an electrostatic recorder. The spectra were mass calibrated 
manually with Ultramark 1621 (PCR Research Chemicals, Inc., Gainesville, PL) 
to an accuracy of better than ± 0.2 Da. 
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d. Electrospray Ionization Mass Spectromettv^ (ESI-MS). 

Oligosaccharides and 0-deacylated LPS were analyzed on a mass 
spectrometer with an electrospray ion source operating in the negative-ion mode. 
Oligosaccharide samples were dissolved in H^O mixed with running solvent (1 
5 /il in 4 A^l), and injected into a stream of HjO/acetonitrile (1/1, v/v) containing 
1% acetic acid, at a flow rate of approx. 20 ^il/'mia Mass calibration was carried 
out with Csl in the negative-ion mode. 

In some cases, selected oligosaccharide fractions were analyzed at higher 
resolving power (M/A M = 2000) using a sector-orthogonal time of flight (TOP) 
10 instrument with an array detector operating under ESI conditions in the 
negative-ion mode. In this case, the solvent system and flow rate were 
essentially the same as described above for the quadnipole ESI experiments. A 
scan speed of 5 sec/decade was used for all samples over the m/z range of 500 to 
3000 with an accelerating voltage of 4 kV and an ESI needle voltage of between 
15 3.5-4 kV higher. Mass calibration was carried out with an external reference 

consisting-ofeshtaken-underUquid-secondaryion-mass-spectrometry"co^^^ 

followed by a one point correction of the doubly charged deprotonated molecular 
ion of the oligosaccharide Srom the LPS of Salmonella typhimurium Ra mutant 
((M-2H)^- exact = 973.2)) in the negative-ion ES-MS mode. 

20 

e. Matrix Assisted Ima ger Desnrptinn Tnnizatinn (MALDT) Mass SpftrtmniRtry 

O-Deacylated LPS samples were analyzed on a Voyager or an Elite 
MALDI-TOF instrument (PerSeptive Biosystems, Framingham, MA) equipped 
with a nitrogen laser (337 nM). All spectra were recorded in the negative-ion 

25 mode using delayed extraction conditions as described in detail elsewhere. 
(Gibson et al. J. Amer. Soc. Mass Spec. 8:645-658 (1997)). Samples were 
dissolved in HjO (approx. 250 pmol/^l), and mixed 1:1 with the matrix solution 
(a saturated solution of 2,5-dihydroxybenzoic acid in acetone) and allowed to dry 
at room temperature on a gold-plated MALDI plate. Approximately 100 laser 

30 shots were recorded for each sample, averaged and then mass calibrated using an 
external mass calibrant consisting of renin substrate tetradecapeptide, insulin 
chain B, oxidized, and bovine insulin (all from Sigma). For external calibrations 
under these conditions, a mass accuracy of 0. 1 % was obtained For comparison 
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purposes, a single point correction was made to the spectra of the O-deacylated 
LPS from PGEM using the expected lipid A fragment ion ((M-H) average = 
952.009), and then the spectra for the three chimeric strains were recalibrated . 
using this lipid A fragment ion and an additional ion from PGEM (m/z 2835.7). 
5 present in all four samples. 

f TgnHpm Ma<;5; Spp.rfmTnetry (MS/MS^ Using Quadrupole-TOF fqTQF). 

Dephosphorylated oligosaccharides were analyzed in the positive-ion 
mode on a mass spectrometer equipped with a nanospray ion source. The 

1 0 analyzer consists of a high pressure RF-only ion guide followed by a quadrupole 
mass filter. A high pressure quadrupole collision cell follows the first mass 
filter. The TOP mass analyzer is comprised of a reflection with an effective 
flight path of 2.5 meters. Samples were dissolved in HjO/acetonitrile (1/1, v/v) 
containing 1% acetic acid, and 2 /zl of each was injected into a nanospray tip. 

15 The nanospray needle voltage was typically 800-1000 V. One sample loading 

^usually-gave-an-anaIysis-time-of-30=40-ndn,JwMctLaUowedaxoTLvm 

spectrum to be obtained prior to the selection of several individxial ions for CID 
MS/MS. In MS mode the high resolution capability (8,000 FWHM) allowed 
unambiguous determination of the charge state for each ion. For CID-MS/MS 

20 operation the quadropole mass analyzer with a mass window of I m/z unit was 
used to select precursor ions for fragmentation, which in most cases were doubly 
charged QA+2Uf*, The selected ions were fragmented in a colHsion cell with air 
as the collision gas and analyzed in the orthogonal TOF operating at an 
accelerating potential of 20 kV. Fragment ion spectra were accumulated under 

25 computer control for periods of between 10 seconds and 1 minute. Mass 
assignments based on external calibration were generally within 50 ppm of 
calculated monoisotopic values whereas internal calibration gave masses 
accurate to +5 ppm. 

30 g SDS-PAGF Analysis of LPS. 

We have previously reported the transformation of £. coli strain JM 109 
(a K- 12 strain which produces rough LPS (r-LPS) which lack the 0-side-chain) 
with a series of plasmids containing overlapping restriction fragments of DNA 
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from the Isg region of//. inJLuenzae type b strain A-2 ( Kwaik et al, 1990). 
Partial LOS segments were produced. As diagrammed in Figure 1, the 
pGEMLOS-4 clone contains all of the complete orfs (orfs A-G) in the Isg region, 
whereas pGEMLOS-5 contains orfs C-G, and pGEMLOS-7 contains orfs F-G. 
5 The clones pGEMLOS-4, pGEMLOS-5, and pGEMLOS-7 were shown by 
SDS-PAGE to produce modified LPS structures which added 1.4, 1.0, and 0.4 
kDa moieties, respectively, to the 4.1 kDa E, coli core (Figure 2). 

h. Analysis of O-Deactylated T.PS hy MALDI-TOF. 

10 For initial screening of LPS molecular weights and heterogeneity by 

mass spectrometry, small aliquots of LPS from PGEM, pGEMLOS-4, 
pGEMLOS-5, and pGEMLOS-7 were treated with anhydrous hydrazine to 
remove 0-linked fatty acids from the lipid A moiety. The PGEM O-deacylated 
LPS sample contains several species in the range of 2738-3172 Da, representing 

15 the major E. coli core structures. When fit to proposed compositions, the 

obse^ved-species-were-found-to-exhibi^hete^ogCTeity-in-hq)tose-(Hep)^hexose — 

(Hex), 3-deoxy-D-mannooctulosonic acid (Kdo), phosphate (Phos), and 
phosphoethanolamine (PEA) (Table 2). Specifically, two main core types were 
observed containing either 3 Hex and 3 Hep (with 2 or 3 Kdos) or 4 Hex and 4 

20 Hep (with 2 Kdos), with variable amounts of phosphate and PEA in both. The 
pGEMLOS-7 O-deacylated LPS mixture contained many of these same species, 
in addition to two major new species at (M-H)- 3334.5 and 3456.8. The m/z 
3334.5 species apparently arises from the addition of Hex and 
N-acetylhexosamine (HexNAc) to the PGEM core structure containing 4 Hex, 4 

25 Hep, 2 Kdo, 2 Phos, and 1 O-deacylated diphosphorylated Lipid A (O-DPLA) 
moiety. A further addition of 1 PEA moiety gives the m/z 3456.8 species. 
These data suggest that the transformation producing pGEMLOS-7 results in the 
addition of a Hex-HexNAc moiety to the £. coli LPS. Likewise, the main 
species in the pGEMLOS-5 O-deacylated LPS (m/z 3700.6 and 3823.6) were 

30 found to arise from the addition of 2 Hex plus 2 HexNAc to the PGEM core 
structure containing 4 Hex, 4 Hep, 2 Kdo, 2 Phos, 1 O-DPLA, and 0 or 1 PEA 
(see Table 2). These structures are also found in the pGEMLOS-4 O-deacylated 
LPS, in addition to new species arising from the ftuther addition of either 
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another Hex (m/z 4083.2 and 4206.4) or HexNAc (m/z 4124.5 and 4246.8) to, in 
this case, the PGEM core structure containing 4 Hex, 4 Hep, 3 Kdo, 2 Phos, 1 
0-DPLA, and 0 or 1 PEA (see Table 2). Of the chimeric LPS structures, only 
these high molecular weight pGEMLOS-4 components contained the third Kdo 
5 moiety, 

i Analysis of Oligosaccharides by EST-MS and LSIMS. 

The LPS fix)m PGEM, pGEMLOS-4, pGEMLOS-5, and pGEMLOS-7 
were subjected to mild acid hydrolysis to liberate free oligosaccharides. 

10 Initially, small aliquots of the oligosaccharide fractions were desalted by size 
exclusion HPLC and analyzed as mixtures by negative-ion ESI-MS. The 
ESI-MS spectra contained predominantly doubly charged ions, (M-2H)^ In 
general, the data were consistent with results from the MALDI-TOF analysis of 
0-deacylated LPS. The PGEM sample was found to contain seven major 

1 5 oligosaccharides and several minor species, ranging in molecular weight from 

1459.3 to 2016.7 Da. As shown in Table 3, proposed compositions were 

determined for the various species which indicated that the structures consisted 
of two main core types; one containing 3 Hex, 3 Hep, and I Kdo, and another 
containing 4 Hex, 4 Hep, and IKdo. Variability in the number of phosphate and 

20 PEA groups was responsible for the large number of species present in the 
mixture. 

The pGEMLOS-4, pGEMLOS-5, and pGEMLOS-7 samples contained 
many of the species foimd in the PGEM sample, in addition to larger molecular 
weight oligosaccharides (Table 3). New LPS glycoforms of M, 2177.7 and 

25 2302.5 were observed in the pGEMLOS-7 sample, consistent with the addition 
of a single Hex and HexNAc residue to the PGEM core structure containing 4 
Hex, 4 Hep, 1 Kdo, 2 Phos, and 0 or 1 PEA The high molecular weight 
components of the pGEMLOS-5 sample (M^ 2543.9 and 2666.5) suggested the 
further addition of yet another Hex-HexNAc unit, and the pGEMLOS-4 sample 

30 contained even higher molecular weight materials (ranging from Mr 2706.1 to 
2870.0) consistent with the addition of one more Hex or HexNAc moiety. 

To aid in the determination of proposed compositions for these species, 
oligosaccharides from the PGEM, pGEMLOS-4, pGEMLOS-5, and 
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pGEMLOS-7 samples were sqjarated by size exclusion chromatography and 
firactions were analyzed by LSIMS and/br ESI-MS. Selected fractions 
representing the two major PGEM core types and the various chimeric structures 
were then pooled, dephosphorylated with aqueous HF, rechromatographed on 
5 size-exclusion HPLC, and analyzed again by negative-ion LSIMS or ESI-MS. 
Proposed compositions for the molecular ions observed after HF-treatment are 
listed in Table 4. Upon removal of phosphate and PEA moieties, the major high 
mass species present in the pGEMLOS-7 sample is an oligosaccharide of M^ 
(avg.) 2020.3 (IHexNAc, 5 Hex, 4 Hep, and IKdo). The pGEMLOS-5 sample 
10 contains an oligosaccharide of M, (avg.) 2386.3, resulting from the further 

addition of 1 Hex and IHexNAc to the pGEMLOS-7 LPS (2 HexNAc, 6 Hex, 4 
Hep, and 1 Kdo). This species is also present in the pGEMLOS-4 sample, in 
addition to higher molecular weight structures containing an additional Hex (M^ 
(avg.) 2548.4) or HexNAc (M, (avg.) 2589.5). 

15 

j.-MonQsaccharide-CoTTipo<?i'fif>Ti-?^Tid-T-inl<-?ige-Analyy;ftR- 

Mass spectrometric analyses of the free oligosaccharides from PGEM, 
pGEML0S4, pGEMLOS-5, and pGEMLOS-7 indicated that the different 
chimeric structures arise from additions of stoichiometric amoimts of hexose and 

20 HexNAc residues to a variably phosphorylated PGEM core structure containing 
4 Hex, 4 Hep, and IKdo. No chimeric structures were observed to contain the 3 
Hex, 3 Hep, and IKdo core. 

For comparison purposes, dephosphorylated oligosaccharide fractions 
containing the two PGEM core types and the main chimeric structures from 

25 pGEMLOS-4, pGEMLOS-5, and pGEMLOS-7 were hydrolyzed in 2N 
trifluoroacetic acid to determine their monosaccharide compositions, and 
therefore the identities of the Hex and HexNAc residues. When analyzed by 
high pH anion exchange chromatography with pulsed amperometric detection, 
the PGEM hydrolysates were found to contain only galactose, glucose, and 

30 L-glycero-D-manno-heptose (Table 5). (The Kdo residue is not recovered under 
these hydrolysis conditions.) The two core types were identified as GalGlCjHepj 
and GalGc3Hep4. The pGEMLOS-7 sample contained GlcNH2Gal2Glc3Hep4 
(Table 5), suggesting that the larger PGEM core was being modified by the 
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addition of one Gal and one N-acetylglucosamine (GlcNAc) residue. Likewise, 
the composition of the pGEMLOS-5 sample suggested that the larger PGEM 
core was being further glycosylated with only Gal and GlcNAc residues. 
Fraction 2 from pGEMLOS-4, which contained the same species as 
5 pGEMLOS-5, gave similar results, and fraction 1 from pGEMLOS-4, which 
contains three main species (see Table 4), contained slightly more GlcNH2. 

Aliquots of the same six dephosphorylated oligosaccharide fractions used 
for monosaccharide composition analysis were taken for methylation analysis to 
establish sugar linkage positions. The partially methylated alditol acetates 
10 observed by GC/MS are listed in Table 6. Again, by comparing the two PGEM 
core types, it is relatively straightforward to see that the second terminal heptose 
of the larger PGEM core is converted to a 1,7-linked heptose in all of the 
chimeric structures and thus must represent the linkage site for the novel 
glycosylation. Since no chimeric structures were observed with the Hcp^ core, it 
1 5 is most likely that the nonreducing teraainal heptose recently identified on the 

olig osaccharide bra nch^in_the.KrJ.2xQre,stnicto^ 

heptose. Additionally, no new trilinked saccharides were obtained from the 
chimeric oligosaccharides, suggesting that the sugars were most likely all added 
in a straight chain. 

20 

k- Sequencing of Chimeric OlignsarrhflriHeg hy iKAfi/Mfl 

To confirm the identity of the linkage site between the E, coli LPS core 
and the novel oligosaccharide moieties, and to determine the sequences of the 
added sugars, the dephosphorylated oligosaccharides were subjected to MS/MS 

25 analysis. For these experiments, samples were run in the positive-ion mode and 
doubly charged molecular ions, (M+2H)^% were selected for collision-induced 
dissociation (CID). Various reducing-terminal (Y-type) and non-reducing 
terminal (B-type) sequence ions are present in the spectra For the PGEM 
oligosaccharide, the Y ion series including the Y^,, (m/z 732.2 (2+)), Y„5, (m/z 

30 651.2 (2+)), and Y^^, (m/z 1 139.3) fragment ions, and the corresponding B ion 
series including the 83^, (m/z 517.2), B4„, (m/z 841.3). Bj (m/z 1225.4), and B^ 
(m/z 1417.4) fragment ions, support the published structure with the fourth 
heptose on the non-reducing terminus of the largest oligosaccharide branch. In 
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addition to these sequence ions, several ions present in the spectrum apparently 
arise from internal cleavages, which can occur under high energy CID 
conditions. In the spectrum of the pGEMLOS-5 oligosaccharide, two similar Y 
and B-type ion series clearly define the sequence and linkage site of the added 
5 tetrasaccharide. Intense B ions at m/z 366. 1 (B2„.') and 73 1.3 (84^,) arise from 
the sequential cleavage of two Hex-HexNAc moieties. These losses are also 
represented by the corresponding Y^^,' (ra/z 2020.6) and 7^, (m/z 1655.5) 
fragment ions. Fragment ions at m/z 923.3 (B j^JO and m/z 1463.4 (Y 
confirm that the Hex-HexNAc-Hex-HexNAc moiety is linked to a heptose, and 
10 additional cleavages fiirther along the large oligosaccharide branch confirm that 
the novel tetrasaccharide is attached to the largest branch of the PGEM core 
structure. 

In the MS/MS spectra of the chimeric oligosaccharides from 
pGEMLOS-7 and pGEMLOS-4, intense B ions also clearly defined the 

1 5 structures of the added sugar moieties. In the pGEMLOS-7 oligosaccharide (M^ 

2019.7)3 a B ion at m/z 366. 1 conresponds to a single Hex-HexNAc disaccharide 

moiety. The pGEMLOS-4 oligosaccharide of 2587.9 
(HexNAc3Hex6Hep4Kdo) lost a HexNAc-Hex-HexNAc firagment (m/z 569^) 
and a HexNAc-Hex-HexNAc-Hex-HexNAc fragment (m/z 934.3), whereas the 

20 pGEMLOS-4 oligosaccharide of M, 2546.8 (HexNAc2Hex7Hep4-do) lost a 
Hex-Hex-HexNAc (m/z 528.2) and a Hex-Hex-HexNAc-Hex-HexNAc (m/z 
893.3,) firagment In addition to those B-type ions, the latter spectrum also 
contained large ions at m/z 366.1 and 731.3, which apparently arise as internal 
fragments in that case. 

25 Assuming that the oligosaccharides are built up sequentially, i.e., from 

pGEMLOS-7 to pGEMLOS-5 to pGEMLOS-4, the MS/MS data, in combination 
with our methylation analysis results, allows the partial structures of the chimeric 
oligosaccharides to be deduced as shown in Figure 3. 

The structural data support the prediction that E. coli K- 12 transfonned 

30 with plasmids containing portions of an eight gene segment from H. influenzae 
involved in LOS biosynthesis makes chimeric LPS which can be modified to 
produce oligosaccharide essentially identical to that of K influenzae. Moreover, 
we have shown that the chimeric LPS are segregated hybrid-type structures. 
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where the £. coli R-LPS core structure is first synthesized and then serves as a 
scaffold for H. influenzae LOS biosynthesis enzymes to add a second 
independent set of sugars not found in the parent E. coli strain. Thus, the 
biosynthetic pathways appear to be sequential (segregated) and not intermixed. 
5 Before this invention was made, the role of the terminal branch heptose 

in the £. coli R-LPS as the acceptor for oligosaccharide elongation or the 
requirement for a funcitonal initiator enzyme was unknowa The published 
structure for the complete £. coli K- 12 core did not contain a second terminal 
heptose, but rather had this fourth heptose as part of the inner core region. The 

10 oligosaccharide branch was believed to terminate in glucose, which was 

proposed to be the acceptor site for 0-antigen and other substituents. The role of 
the initiator enzyme was unknown. It is now apparent that only E. coli R-LPS 
structures containing this fourth heptose (i.e., complete core structures) 
underwent elongation in the plasmid-transformed chimeric strains and thus, only 

15 those E, coli having this composition are useful as production cells for the 

production of H, influenzae. In the chimeric structures, GlcNAc is the first su g ar 
added to the seven position of this heptose. There are two possible explanations 
for this crucial first step in the elongation sequence. One, an 
N-acetylglucosamine-specific glycosyltransferase firom Haemophilus encoded in 

20 orfif or orfg either has this precise specificity or is promiscuous enough to allow 
this reaction to occur. Two, some analogous E, coli glycosyltransferase gene is 
being activated by a Haemophilus regulatory gene. Since sequence comparisons 
of the seven genes contained in this plasmid suggest that both 
glycosyltransferase and regulatory genes are present, both explanations are 

25 possible. However, the fact that terminal GlcNAc and 1,7- linked heptose have 
been found in non-stoichiometric amounts in some other strains of £. coli K- 12 
suggests that the addition of this sugar to the terminus of the oligosaccharide 
branch is accomplished by E, coli enzymes. It was recently reported that when 
mutations causing the rough phenotype in E. coli K- 12 are complemented, the 

30 complemented strains produce an O-aniigen which has GlcNAc at the reducing 
tenninus of the repeat imit Regardless of the mechanism of this first key step in 
the extension of the PGEM core, it appears that addition of this GlcNAc is 
rate-limiting, since a large percentage of unmodified PGEM core R-LPS remains 
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in the chimeric mixtures. Furthermore, very little or no intermediate structures 
are observed as one progresses from pGEMLOS-7 to pGEMLOS-5 and 
pGEMLOS-4, suggesting that once this first GlcNAc is added, the addition of 
the other Haemophilus-iclatQd sugars proceeds quickly to defined end points. If 
5 other steps in the biosynthesis of the chimeric LPS were as incomplete as the 
addition of this first GlcNAc, one would expect to see these other intermediate 
structures, yet none were observed. Therefore, it is likely that a N- 
acetylglucosaminyltransferase from E. coli that is regulated through the product 
of orff or orfg adds this first key sugar in the chimeric structures. Preliminary 
1 0 data on a chimeric construct containing orfg alone showed a mass shift of 203 
Da (HexNAc), suggesting that orfg encodes this regulatory gene. 

The second step in the biosynthesis of the chimeric LPS is the addition of 
galactose to the 3-position of the terminal GlcNAc. The resulting disaccharide, 
GalBGlcNAc, is the structural moiety observed in pGEMLOS-7, which arises 
15 when the transforming plasmid contains orfs F-G from Haemophilus, 

Exanrination of the predicted amino acid sequences of the gene products 

indicates that orff has high homology (66% identity) to a galactosyltransferase 
(osmE) from Erwinia amylovora^ suggesting that it may encode a 
galactosyltransferase in Haemophilus, OrfG does not show any homology to 
20 known oligosaccharide biosynthetic genes, but is homologous (64% identity) to 
a gene encoding the ModE protein in £. coli. This protein is involved in 
molybdenum transport and regulation of transcription, suggesting that oxfG may 
encode a regulatory protein bom Haemophilus (which may be regulating an 
iV-acetylglucosaminyltransferase gene from E. coli in the chimeric strains). 
25 In the pGEMLOS-5 strain, an additional three genes are contained in the 

transforming plasmid (orfs C-G) and an additional GlcNAc and Gal are observed 
in the resulting LPS. These sugars now define the tetrasaccharide 
Gall-4GIcNAc-3Gall3GlcNAc. The LPS from this transformant is now reactive 
to the 3F1 1 MAb, suggesting that this new disaccharide is betalinked to form the 
30 terminal trisaccharide, Gall4GlcNAcl-3GaL All of the new orfs contained in 
this plasmid have some homology with known glycosyltransferase genes: orfC 
has homology with the asmT> (26% identity) from Erwinia amylovora, which 
encodes a glycosyltransferase for exopoly saccharide synthesis, and TrsD (38% 
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identity) from Yersina entercolitica, a gene involved in LPS inner core synthesis, 
orfD has homology with the sialyltransferase gene {1st) (27% identity) from 
Neisseria gonorrhoeae^ and orfE has homology with a putative 
glycosyltransferase gene (77% identity) from Actinobacillus sp. and the 
5 galactosyltransferase gene, amsB (27% identity) from Erwinia amylovora. The 
fact that these three additional orfs in the transforming plasmid apparently result 
in the addition of only two more sugars to the growing oligosaccharide chain 
may indicate that the acceptor for one of the glycosyltransferases is absent in the 
chimeric LPS. 

10 When two more orfs are added in the transforming plasmid (orfs A-G) to 

form the pGEMLOS-4 chimeric strain, we observe that the 3F1 1 epitope 
(Usappears and the terminal Gal residue of the epitope is capped by either a 
second Gal or a GlcNAc moiety, apparently linked to the 6-position of the GaL 
These new species present in the pGEMLOS-4 LPS population were also 

15 observed to contain a third Kdo moiety, presumably somewhere in their core 

^regionsr-While-some-ofthe-incGmplete-eore-stnietures-found-in-the-vvild-^ 

coli K- 12 LPS populations also contain a third Kdo, of the chimeric structures, 
only the structures imique to pGEMLOS-4 were found to contain a third Kdo. 
This chimeric strain was also recognized by MAb 6E4 which recognizes an inner 

20 core, Kdo-related epitope in H. influenzae, suggesting that this third Kdo forms a 
different epitope than the one found in the core structure of the wild-type E, coli 
LPS. Thus, the addition of orfs A and B to the transforming plasmid which 
fon-ned strain pGEMLOS-4 seems to have multiple effects on the chimeric LPS 
structure. OrfB is homologous (46% identity) to the sialyltransferase gene from 

25 N. gonnorhea. OrfA is homologous to both the Rib X gene product (22 % 
identity) from E, coli and TrsA (24 % identity) of K entercolitica. These are 
putative O-antigen transporters (36,37), suggesting that orfA may encode a 
flippase. 

While sialyl-N-acetyllactosamine-containing structures are only minor 
30 components of the wild-type H. influenzae type b strain A2 LOS population, we 
have previously seen that Isg genes are involved in the synthesis of this epitope. 
Transposon mutagenesis of orfD produced mutant strain 281.25, that lost all 
ability to add galactose to Hib LOS glycoforms. This strain could not make any 
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of the wild-type LOS structures larger than the major species containing four 
glucoses and three heptoses. Mutation of orfE (which is downstream of orfD) 
produced strain 276.4 which had essentially the same defect, except for one 
important difference: strain 276.4 retained the ability to make the 
5 sialyl-N-acetyllactosamine epitope. These results suggest that in the transposon 
mutants, the knockout of orfD has a polar effect on orfE, which would imply that 
the gene product of orfE is a galactosyltransferase required for synthesis of the 
higher molecular weight wild-type structures containing terminal gaiactose(s) on 
their glucose disaccharide branches and the gene product of orfD is likely an 
10 A/^acetylglucosaminyltransferase required for the synthesis of the 

sialyl-N-acetyllactosamine epitope. The case for these assignments can be made 
on the basis of the homologies noted above (orfE is homologous to a 
galactosyltransferase gene) and the LOS glycoforms observed in the 276.4 and 
281.25 mutant strains. Since no truncated versions of the sialyl-N- 
15 acetyllactosamine structure were seen in the 276.4 LOS population (i.e.; no 
species lacking either sialic acid or sialic acid plus galactose), it seems probable 
that the orfd gene codes for the glycosyltransferase which adds the GlcNAc to 
the oligosaccharide branch. This is also consistent with the observation that one 
of the genes in orfd C-E is apparently responsible for adding GlcNAc to the 
20 3-position of the Gal which is terminal in the pGEMLOS-7 LPS structure. 

This chimeric carbohydrate expression system has provided infonnation 
that is relevant to unraveling the functions of these Isg genes and has the 
additional advantage of being carried out in the absence of the normal 
endogenous genetic background on K influenzae. Indeed, while gene knockouts 
25 of some of the Isg genes in i/. influenzae have been completed, downstream or 
regulatory gene effects can often complicate their fimctional analysis. In this E. 
coli expression system, structural analysis of the resulting chimeric LPS has 
shown that synthesis proceeded as a serial (non-parallel) synthesis, that is, the 
new elements of the chimeric LPS were added after the formation of the £. coli 
30 R-LPS. The fact that this synthesis was sequential (rather than interdigitated 
with the R-LPS synthesis, for example) allowed for the functions of these K 
influenzae gene products to be more readily delineated from the chimeric 
oligosaccharide structures. Moreover, screening of the chimeric LPS products 
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with monoclonal antibodies enabled us to follow the formation of terminal sugar 
sequences (epitopes) that are unique to the Haemophilus strain from which the 
plasmid DNA originated. 

All publications and patents cited herein are incorporated by reference as 
5 though fully set forth. 

This invention has been described with respect to specific examples and 
embodiments. However, it is understood that one skilled in the art may make 
variations or modifications that are within the spirit and scope of the invention. 
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Table 1. Bacterial Strains, LPS and Vectors 



Strain/ 
Plasmid 


Relevant characteristics 


Reference/source 


£. coli 








fee At supEp hsR, AflcLc^pro) 




H. influenzae 






A2 


Parental strain 


(10) 


Plasmid 






pGEM3Zf+ 




Promega Biotech 


pGEMLOS-4^ 


Ap^ contains 7.4 kb bamHl-pstl DNA 
H. influenzae isg locus 


This study 


pGEMLOS-5 


Ap^ contains 5.5 kb hindSl-pstl DNA 
//. influenzae Isg locus 


This study 


pGEMLOS-7 


Ap^ contains 2.8 kb sphl-pstl DNA 
//. influenzae Isg locus 


This study 


LPS 






pGEM 


isolated from strain JM109 
transformed with pGEM3zf+ 


This study 


pGEMLOS-4 


isolated from strain JM109 
transformed with the plasmid 
pGEMLOS-4 


This study 


pGEMLOS-5 


isolated from strain JM109 
transformed with the plasmid 
pGEMLOS-5 


This study 


pGEMLOS-7 


isolated from strain JM109 
transformed with the plasmid 
pGEMLOS-7 


This study 
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Table 4. Molecular weights (M^) and proposed compositions 
of the dephosphorylated ollg saccharides from 
pGEM, pGEMLOS-7, pGEMLOS-5, and pGEIVILOS-4" 



Fraction 


Obs. M, 


Cal( 


:. M, 


Proposed compositions 




(avg.) 


(avg.) 


(exact) 










pGEM (Fr. 2) 


1300.8 


1301.1 


1300.4 




3Hex 


3Hep 


IKdo 


pGEM(Fr. I) 


1655.3 


1655.4 


1654.5 




4Hex 


4Hep 


IKdo 


pGEMLOS-7 


2020.3 


2020.8 


2019.7 


IHexNAc 


5Hex 


4Hep 


IKdo 


pGEMLOS-5 


2386.3 


2386.1 


2384.8 


2HexNAc 


6Hex 


4Hep 


IKdo 


pGEMLOS-4 


2386.3 


2386.1 


2384.8 


2HexNAc 


6Hex 


4Hep 


IKdo 


(Fr.2) 














pGEMLOS-4 


2386.1 


2386.1 


2384.8 


2HexNAc 


6Hex 


4Hep 


IKdo 


(Fr.l) 
















2548.4 


2548.2 


2546.8 


2HexNAc 


7Hex 


4Hep 


IKdo 




2589.5 


2589.3 


2587.9 


3HexNAc 


6Hex 


4HeD 


IKdo 



Observed molecular weights are reported as average mass values. 
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Tables. Monosaccharide c mpositions of the 
dephosphorylated oligosaccharide fractions'' 





pGEM 
(Fr.2) 


pGEM 
(Fr.l) 


pGEM- 
LOS-7 


pGEM- 
LOS-5 


pGEM- 
LOS-4 
(Fr.2) 


pGEM- 
LOS-4 
(Fr.l) 


GlcN 






1.0 


2.7 


2.4 


3.3 
















Gal 


0.9 


l.l 


1.7 


3.0 


2.8 


3.1 
















Glc 


2.2 


3.2 


3.2 


3.4 


3.4 


3.4 
















Hep 


3.0 


4.0 


4.0 


4.0 


4.0 


4.0 



" Molar ratios were derived fixnn comparison to the hydrolysate of the S. 
typhimurium Ra oligosaccharide of known composition, and then those values 
were normalized to either 3.0 or 4.0 heptoses in each fraction. 



29 

SUBSTITUTE SHEET (RULE 26) 



wo 00/70060 



PCT/USOO/13538 



Table 6. Methylation analysis of the 
depbosphorylated oligosaccharide fractions" 





pGEM 
(Fr.2) 


pGEM 
(Fr. 1) 


dGEM- 
LOS-7 


pGEM- 
LOS-5 


dGEM- 
LOS-4 
(Fr.2) 


nGFM- 
LOS-4 
(Fr.l) 


T-Glc 


0.5 


0.2 
























T-Gal 


0.6 


0.5 


1.2 


0.9 


0.7 


0.7 
















U-Glc 


0.2 


0.7 


1.0 


0.8 


' 0.8 


0.7 
















1,3-Gal 








0.4 


0.5 


0.6 
















1,6-Glc 


0.4 


1.1 


1.6 


1.5 


1.3 


1.3 
















1,6-Gal 










0.3 


0.6 
















1,3,6-Glc 


1.0 


1.0 


1.0 


1.0 


1.0 


1.0 
















T-Hep 


0.3 


0.6 


0.6 


0.3 


0.3 


0.2 
















1,3-Hep 


0.3 


0.3 


0.3 


0.4 


0.6 


0.4 
















1,7-Hep 






0.4 


0.6 


0.5 


0.6 
















1,3,7-Hep 


0.4 


0.4 


0.2 


0.3 


0.5 


0.5 
















T-GlcNAc 






0.1 


0.2 


0.2 


0.3 
















1,4-GlcNAc 








0.9 


1.1 


1.0 
















1,3-GlcNAc 






0.3 


0.7 


0.7 


1.0 



" Peak areas were measured from the GC/MS EI total ion chromatograms, and 
values were normalized to the 1,3,6-glc residue. The data for pGEMLOS-7 are 
the average of two runs. 
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WHAT IS CLAIMED IS: 

1 . A process for the production of a complex carbohydrate which comprises 
the steps of: 

5 (a) inoculating transformed production cells into a culture medium 

capable of supporting the growth of said production cells wherein said 
production cells are prepared by transforming bacteria comprising (i) a core lipid 
structure containing a terminal heptose molecule and (ii) an enzyme capable of 
adding an acceptor molecule to said heptose molecule by inserting an isolated 
10 DNA sequence encoding glycotransferase synthesizes a complex carbohydrate 
into said bacteria to yield transformed production cells; 

(b) allowing growth of said transformed production cells; 

(c) recovering said complex carbohydrate from the culture mediimi. 

15 2. The transformed production cell of claim 1 . 



3. The transformed production cell of claim 2 comprising a gram-negative 
bacterium having a terminal heptose on a kdo core and having inserted an 
isolated DNA sequence encoding the glycotransferase catalyzing the synthesis of 

20 an oligosaccharide of Haemophilus influenzae. 

4. The transformed production cell of claim 3 comprising Escherichia coli 
K-12 strain JM 109. 

25 5. The process of claim 1 wherein the bacteria are gram-negative bacteria. 

6. The process of claim 1 wherein the bacteria is Escherichia coli K-12 
strain JM 109. 

30 7. The process of claim 1 wherein the acceptor molecule is [N-acetyl] 
galactose. 
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8. The process of claim 1 wherein the isolated DNA sequence encodes a 
functional Haemophilus influenzae glycotransferase. 

9. The process of claim 1 wherein the isolated DNA sequence encodes a 
5 functional Neisseria gonorrhoeae glycotransferase. 

1 0. The complex carbohydrate made according to the process of claim 1 . 

11. A process for the production of an oligosaccharide which comprises the 
10 steps of: 

(a) transforming gram-negative bacteria comprising (i) a core lipid 
structure containing a terminal heptose and (ii) an enzyme that adds a galactose 
molecule to said heptose wherein said transformed gr^-negative bacteria are 
prepared by constructing a vector comprising an isolated DNA sequence coding 
IS for a glycotransferase that synthesizes an oligosaccharide; 

(b) inoculating said transformed gram-negative bacteria into a 

culture medium capable of supporting the growth of said 
transformed bacteria; 

(c) allowing growth of said inoculated gram-negative bacteria; and 
20 (d) recovering said oligosaccharide from the culture medium, 

12. The process of claim 1 1 wherein the transformed bacteria is Escherichia 
coli K-12 transformed with an isolated DNA sequence from Haemophilus 
influenzae. 

25 

13. The ohgosaccharide made by the process of claim 11. 

14. The process of claim 1 1 wherein the transformed bacteria is Escherichia 
coli K-12 transformed with an isolated DNA sequence from Neisseria 

30 gonorrhoeae. 

15. The oligosaccharide made by the process of claim 14. 
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16. A process for the production of a complex carbohydrate, comprising 
culturing production cells comprising a chimeric DNA sequence encoding a 
glycotransferase so as to yield production cells comprising an altered level of 
complex carbohydrate, wherein the production cells are bacteria comprising a 

5 core lipid structure containing a terminal heptose molecule and encoding an 
enzyme capable of adding an acceptor molecule to the heptose molecule. 

1 7. The process of claim 1 6 further comprising recovering the complex 
carbohydrate. 

10 
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(A) Lipooligosaccharide synthesis genes (Isgf) 



BaniHl BgH\ 



[L 



HiniW DrsA 



SpH 



DrsA 



J L 



EcoBI 



PsA 



B 



D 



G (H) 



(B) Transposon inserts 

0 281 .25 

□ 276.4 



(C) Chimeric transfonnants 

pGEMLOS-4 

' pGEMLOS-5 'acZ 
lacZ pGEMLOS-7 



lacZ 

FIG. 1 
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A B C D 




AspGEM 
B=p6EML()S>7 
C^pGEMLDS-S 
()=pGEMlJ0S-4 



FIG. 2 
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E. coli r-LPS Gafal Hepa| 

pGEM 6 7 

Glcal -»3Glca1 -^Hepal -»3Hepa1 -^5Kdo 



Galal Hepal 
i i 
6 7 

>7Hepa1 -»6Glca1 -*2GIca1 -*3Glca1 -»3Hepa1 ^SHepal -»5Kdo 



Chimeric LPS R = 

pGEMLOS-7 |Gal1-»aGlcNAiij 4- 



pGEMLOS-5 



Gal1-»4GlcNAo1-»3Gal1-f36loNAc1 
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